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RICKETTSIAL DISEASES AKH MEmODS OF USE 

5 

This invention was made with gDvcnizne^ 
G-00-3030-00. The governmait has cotaia rights in Qdsin^ 

(;>pfs-Rcfqrence to^ I^teted Application 
10 This is a continuatxon-itt-part of U.S. pateot application Serial No. 08/733,230, filed 

October 17. 1996. 

Technical Field 

This invention relates to nucleic acid vaccines for rickettsial diseases of animal$» 
IS including humans. 

Bac?kgrovc4 Of thc InTOltigfl 
Thcricicettsias are a ffcuxp of small bacieriaconmonly transnutted t^axt^^ 
. to man and animais. in which they may cause serious disease. The pathogens causmg human 
.20 rickettsial diseases inchide the agent of epidemic ^pbus. XickeUsia pr0)nfa2ddi^ which has 
resulted in the daaths of milUons of pe^^led^^ The causatiye . 

agents of spotted fever. e.g., RidxUsia rickeasn mi Rkkeil^ are also included within 
thisgnnq). Recently, new types bfhtimaA rickettsial disease caused by mem^ 
£Ar//cftfa9 have been described. £AWicAiae Meet leukoQrtes and endothelial cells of many 
25 dtfrexentinanmuilianqpaeieSy some offli^ 

Over 400 cases of human ehrhch^sis. inebidmg some fatalitipyi, caused by Ehriichia duffeeasis 
have now been reported. CHmiral signs of humttehilidiipds are similar 
Mountain spotted fever, including fever, nausea, vomiting, headache, and rash, 

Heaitwater is another infeetious disease caused by a rickettsial pathogen, namely 
30 Cowdriarwninmtium^zodiitm^^ The disease occurs 

dvmighout roost of Africa and has an estimated endemie area of about 5 milUon square miles. 
In endemic areas, hCTrtwater is a latent infcctian in indigenous breeds of cattle ihat have been 
subjected to centuries of natmal selection. The problems occur where the disease contacts 
suscepriblecB' naive cattle and cither nnninants. Heartwaterhasbeenconiimicdtobconthc 
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island ofGuaddoupc in the Qribbearifeid is qs^^ Tlietick 
vectors icsponsible for s^MTcading this disease arc already piesent on tbc American mainland and 
threaten the livestock indusby in Noifh and South >^ 

In acute cases of hcartwatcr, animals cxhflrit a sudden rise in temperature, signs of 
5 anorexia, cessation of rumination, and nervous symptoms including staggering, muscle 

twitching^ and convulsions. Death usualfy occurs during these convulsions. Pcracute cases of 
the disease occur where the animal collapses and dies in convulsions havmg shown no 
preliminary symptoms. Mortality is high in suscq>tiblc animals. Angcm shcqj infected with 
the disease have a 90% mortality rate while susceptible catde strains have up to a 60% mortality 
10 rate. 

If detected early, tetracycline or chloramphenicol treatment are effective against 
rickettsial infections, but symptoms are similar to munerous other infections and there ate no 
satisfactory diagnostic tests (Hehnick, C, JL Bernard, L. lyAngelo [1984] J. InfecL Dis. 
150:480), 

IS Animals which have recovered from heartwaterare resistant to fiirther homologous, and 

in some cases heterologous, strain challenge. It has shrnlarly been found fliat persons recovering 
from a rickettsial infccdcm may develop a solid and lasting immunit/. Individuals recovered 
fiorn natural infections are often immune to multiple isolates and even species. For example, 
guinea pigs immunized with a recombinant JZ. conorii protein were partially protected even 

20 'against A rtiAeitsU ^ishwanath, G. McDonald, N. Watkms [19901 Infect Inumm. 58:646). 

- - - Itis kiiovmthatOKrei^ 

isolates. Huis. a fiinctianal recombiiiant vaccine against multiple i«>lateg would need to contain - 
multiple epitopes. e.g,^ protective T and B cell qiitqpcs, shared between isolates. It is believed 
that serum antibodies do not play a significant role in the mechanism of immunity agamst 

2S riekett^ (Uiienbei^ G. [\m\Admices m Vet Sd. and Camp, ik/ei/. 27:427-480; Du Flessis, 
Plcssis, Ji. [1970] Ondentqhort J. VeL Jte. 37(3):147-150). 

Vaccines based on inactivated or attenuated rickettsiae have been developed against 
certain rickettsial diseases, for exainple against ilprinM^^ However, these 

vaccines havemajor poblems or disadvantages, including ondesireble toxic reactions, diffictdty 

30 in standardisation, and e^xose (Woodward, T. [1981] "Kiekettsial diseases: cer^m unsettled 

problems in their historical pexspective»'*foiUdi^^ Burgdorfer 
and R. Anackcr; eds^ Academic Press, New Yor]^ pp. 17«40). 

A vaccine currently used in the codrol of hcartwatcr is composed of live irif eeled sheep 
blood. This vaccine also has several disadvantages. First, expertise is rt:quired for dte 
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intravenous inoculati n techniques xtquixcd to administer this vaccine. Second, vaocinatcS 
animals may expcricaice shock and so require daily monitoring for a period ailcr vaccination. 
There is a possibility of deaOi due to shock throughout this monitoring period, and the drugs 
needed to treat any shock induced by vaocinaticm are costly. Third, blood-bome parasites may 
5 be present in the blood vapcinc and be trannnitied to the vaccinates FinaU^, the blood vaccine 
requires a oold chain to preserve the vaccine. 

Clearly, a safer, more efifective vaccine that is easily admintstered would be particidarly 
advantageous. For these icasoos. and with the advent of new methods in biotechnology, 
investigators have concenizated recent^ on the development of new types of vaccines^ including 

10 icmcnbinantvaednea. However, recombinant vaoci^ 

presented to the inunune system such that fihaied epitopes are recognized. These factars have 
contributed to the search for effective vaccines. 

A protective vaccine against ridcettsise that dicits a complete immune response can be 
advantageous. A few antigens which potentially can be useful as vaccines have now been 

IS identified and sequenced for various padiogcoic rickcttsia. The genes encoding the antigens and 
that can be employed to lecombinantly produce those antigen have also been identified and 
sequenced. Certain protective antigens identified fi>r It rickeilsih conorii, and JL prowa2ekii 
{^g,^ lOnq^A and tOn^) are large ^100 kDa), dependent on retention of native conformation 
for protective efficacy, but are often degraded when produced in recombinant systems. This 

20 presents technical and quality-control problems if purified recombinant proteins are to be 

~ ~ included in a vacdne. Themode of presentation of a recombinant antigen to the immune system 

can also be an important &ctor in ^ iimnune response. 

Nucldc add vacdnation has been ^u>wn to induce protective imrnune re^ 
viral systems arid in diverse aniind spcdes (Special Qmfetence Issi^ 

25 acid vacdnes [1994] Vaccine 12:1491). Nucleic acid vaccination, has induced cytotoxic 
lymphocyte (CIL), T-helper 1, and antibody responses, and has been shown to be protective 
against disease (Uhner, J. Dcmclly, S. Parker ei al, [1993] Science 259;1745). For example, 
direct imramuscular injection of mice withDNA encoding the influenza nucleoprotein caused 
the producticoi of high titer antibodies, nudcoprotein^^iediic CTLs, and protection against viral 

30 challenge. Immunization of mice with plasmid DNA encoding the Plasmodium yoelii 

circumsporozoite fsutein induced high antibody titers against mabna sporozoites and CTLs, and 
protection against ohallcfnge infection (Scdegah, VL Iledstxom, P. Hobart, S. Hof&nan [I9SM] 
JProc, NatL Aceid. ScL USA 91:98(96). Cattle immunized with plasmids encoding bovme 
herpesvirus I (BHV-1) glycoprotein IV developed neutralizing antibody and were paitiatfy 
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iShitccted (Cox, G., T. Zamb. L Babiuk [1993J / Wnrf. 67:5664). However, it has been a 
quesdon in the field of immunization vfht&cr the teccnlly discovered technology of nucleic acid 
vaccines can provide improved protection against an antigenic drift variant. Moreover, it has 
not heretofore been recognized or suggested that nucleic acid vaccines may be successful to 
S protect against rickettsial disease or that a major surface protda conserved in xickettsia was 
protective against disease. 

grief ginntnary ftg TuYCTltion 
Disclosed and claimed here are novel vaccines for conferring immunity to rickcttsia 
infection, including Cowdria ruminantium causmg heartwater. Also disclosed aze novel imcldc 
acid composi tions and mefiiods of using those compositions, including to confer immunity in 
a susceptible host- Also disclosed are novel materials and methods for diagnosing infcctiozis by 
Ehrlichia in humans or anitnals* 

One aspect of the subject invention concons a nucleic acid, e.^., DNA or mRNA, 
vaccine containing the major antigenic protein 1 gene (MAPI) ct ±t major antigoiic protein 
2 gene (MAP2} of rickettsial pathogens. In one embodiment^ the nucleic acid vaccines can be 
driven by the faurnan c^ytomegalovims (HCMV) cxihanccr-prDmoter. bi studies iitmiuniaang mice 
hy intramuscular injection of a DNA vaccine composition acoordiiig to the subject invention, 
iinonmissed mice seiocoiwertcd and reacted with MAPI in atitigen blots. Splenocyies from 
immunized mice, but not from control mice immunized with vector only, prolifieratecf in 
re^onsetoxecomibinantMAPI aiidrickettsiai aritigens in i/iv/rre^ lymphocyte pro 
In experiments testing dtiferent DNA vaceine dose tegimens« increased survival rates as 
compared to controls were observed on challenge with rxekcttsia. Accordingly, the stibject 
invention concerns the diseoivery flut DNA vaccines can induce protective immunity against 
rickettsial disease or death resulting tinefn^ 

Brief Descritrtton of the Dniwm^ 
FIgnres lA-lC show a comparison of the amino acid sequences ham alignment of tiie 
ftreerickeosial proteins, nunely. Cdvydria nmUnantbm {CrJ),JShrttchia dioffeensis (i£c.), and 
30 Anctplasmame^ghieJe(A,nL), 

Figures 2A-2C shows the DNA sequence of the 28 UDa gene locus cloned from £ 
cAq^eeftfts(Fig.2A-2B)and£. caRirCFxg.2C). One letter amino acid codes for the deduced 
protein sequences are presented below the nucleotide sequence. The proposed sigma-TO-like 
piomoter sequences (38) are presented in bold andunderlmed tact as -10 and -35 (ccmsensua -35 



15 



20 
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and -10 sequences are TTQACA and TATAAT» resjii^tively). Similarly, consensus ribosomal 
binding sites and transcription terminator sequences (bold letta sequence) aie identified. G-nch 
regions identified in the £ chaffeensis sequence arc undalined. The conserved sequences fiom 
wittiixi tlie coding regions selected for RT-FCR assay are identified with italics and underlined 
S text. 

Figure 3AdK)ws the complete seqi^noe of the MAF2ho^ The 
arrow (-^) represents the predicted start of the mature protein. The asterisk (*) represents tfie 
stcq> codoxu Underlined nucleotides to the open reading fi:ame with -35 and -10 below 
lepresent predicted promoter sequences. Double underlined nucleotides rqresent the predicted 
10 ribosomal binding siie. Underlined nucleotides 3' to the open reading fi^me represent possible 
transcription tcrminatian sequences. 

Figure 3B shows the complete sequence of the MAP2 homolog of Ehrlichia dicffeensis- 
The anxyw (-^) represents the predicted start of the mature protein. The asterisk (^) represents 
the stop codon. Underlined nucleotides 5' to tiie open reading fiam^ with -33 and -10 below 
IS represent {uedicted promoter sequences. Double underlined micleotides represent the predicted 

ribosomai binding site. Underlined nucleotides 3* to the open reading teme represent possible 
^ transcription tcmunation sequences. 

Brief Pescriptiffn of the Sequences 
20 SEQ1D NO. 1 is the coding sequence of the MAPI gene fi;om Cowdria ruminantium 

' ' ' (Highway isolate). . . 

SEQ ID NO. 1 is the polypeptide encoded by the polynucleotide of SEQ ID NO. 1 . 
SKQ HI NO. 3 is the coding sequence of the MAPI gene l^om Ehrtichia chaffeensis. 
SEQ ID NO. 4 is the polypeptide encoded by the polynucleotide of SEQ ID NO. 3. 
2S SEQ ID N0« S is the Anaptasma maf;gf/ia/e MSP4 gene coding.sequenoe. 

SEQ ID NO* 6 is the polypeptide encoded (he polynucleotide of SEQ ID NO. S. 
SEQ ID NO. 7 is a partial codmg sequence of the VSAl gene from Ehrlichia 
chcffeensis^ also diown in Figures 2A-2B. 

SEQ ID NO. 8 is the coding sequence of the VSA2 gene fiom Ehrlichia chaffeensis. 
30 also shown in Figures 2A-2B. 

SEQ ID NO. 9 is the coding sequence of the VSA3 gene from EMiehia chaffeensis, 
also shown in Figures 2A'-2B. 

SEQ ID NO. 10 is the codmg sequence of the VSA4 gene bom Ehrlichia chaffeensis^ 
als shown in Figures 2A-2B. 
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SEQ ID NO. 11 is a partial coding sequence of the VSA5 gcae bom EhriicHla 
chajfeensis, also shown in Figures 2A-2B, 

SEQ ID NO. 12 is the codmg sequence of the VSAl gene fiom Ehrlichia canis, also 
shown in Flguie2C 

S SEQ m NO. 13 is a paitial coding sequence oftheVSA2 gene from 

also shown in Figure 2C 

SEQ ID NO. 14 is the polypeptide encoded by the polynucleotide of SEQ ID NO. 7, 
also shown in Figures 2 A-2B. 

SEQ ID NO. 15 is the polypeptide encoded by the polynucleotide of SEQ ID NO. 8, 
10 also shown in Figures ZA-2B. 

SEQ m NO. 16 is the polypqitide encoded by die polynucleotide of SEQ ID NO. 9, 
also shown in Figures 2A-2B. 

SEQ ID NO. 17 is the polypq;ytide encoded by the polynucleotide of SEQ ID NO. 10. 
' also shown m Figures 2A.<>2B. 
13 SEQ m NO. 18 is fhc polypeptide cmK^ by the poiynuc^^^ 11« 

also shown in Figures 2A-2B, 

SEQ ID Na 1? is the polypeptide encoded by the polynucleotide of SHQ ID NO, 12» 
also shown In Figura 2C. 

SEQ ID rid. 20 is the polypeptide encoded by the polynucleotide of SEQ ID NO. 13, 
20 also shown in Figure 2C. * 

- - . . SEQ ID NO» 21 is the coding sequence of die MAP2geiie from £Aif^^ 
shown in Figure 3A. " — - 

SEQ ID NO. 22 is the coding sequence of die MAP2 gene fircnn Ehrlichia chaffeatsis^ 
also shown in Figure 3B. 
25 SEQ ID NOl 23 is Oie polypeptide en^ 

also shown in Figure 3A. 

SEQ ID NO. 24 is Oie polypeptide encoded by the polynucleotide of SEQ ID NO. 22, 
also diown in Figure 3B. 

30 Detailed Discloswe of the Invenrion 

In one embodinient, fl)e subject invention concerns 
vaccination, for diciting an immune le &pons e pix)tective against rickettsial disease. The subject 
invention also conccms n vel oonqioidtions fiiat can be enoployod acceding to diis novd 
strategy for eliciting a protective inunune response. According to tiie subjca invention. 
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recox&binanl plasmid DNA or iziRNA encoding an antigen of interest is inoculated directly into 
the human or animal host where the antigen is expressed and an immune response induced 
Advantageously, problems of protein puriGcationy as can be encountered with antigen delivexy 
using live vectors, can be viituaUy eliminated by employing the conqKisitions or methods 
5 accordmg to the subject invention. Unlike live vector deUveiy, die subject invention can provide 
afurdier advantage in Aat the DNA or BNA does iioftzep^^ 

vrith gene e]qiressi<m directed for as long as 19 m(mths or naoiepost^ Sec, for example, 

Wolff, JA, JJ. Ludike, G. Acsadi, P. Williams. A. Jani (1992) Hum. MoL Genet. 1:363. A , 
complete imnume response can be obtained as recombinant antigen is synthesized intracellularly 
10 and piesented to the host immune system in the context of autologous class I and class TL ISBC 
molecules. 

In one embodiment, the subject invention concerns nucleic acids and compositions 
comprising (hose nucleic acids that can be effective m protecting an animal fiom disease or 
deafli caused by rickettsia. For example, a nucldc acid vaccme of flie subject inveidon has been 

IS shown to be protective against Cowdria runiinantium^ the causative agent of heanwater in 

domestic ruminants. Accordingly, DNA sequences of rickettsial genes, e.g., MAPI or 
homologues thereof^ can be used as nucleic acid vaccines against human and animal rickettsial 
diseases. The MAPI gene used to obtain this protection is also present in other rickettsiae 
including ite^/osma marginale, EhHidda canis^ and in a causative agent of himian ehrlichiosis^ 

20 EhrHehia chaffeertsis (van Vlict, A, K Jongejan, M, van Kleef, van der Zcijst [1994] Infect 
/mmtiii/62:l451). Ihe MAPI gene or aMAPl-likegenecanalso befoul 
spp. MAPl-Uke genes from Ehrlichia chaffeensis and EhHichia canis have now been cloned 
andsequenccd. TheseMAF-l homologs axe also refiened to herein as Variable Surface Antigen 
(VSA) gpncs. 

25 The present invention also ccmcems polynucleotides encoding MAP2 or MAP2 

homologs fiom Ehriickia amis and Ehrlichia chaffeenm. MAP2 polymicleotide sequences of 
the invention can be used as vaccine compositions and in diagnostic assays. Tliepo^ucleotides 
can also be used to produce the MAPZ polypeptides encoded thereby. 

Compositions comprising the subject polynucleotides can include appropriate nucleic 

30 acid vaccine vectors ^dasmids), which are commerdaHy available (e.g.^ Vical« San Diego, CA). 

In addition, the compositicms can include a pharmaceuticaQy acceptable earner, ag., saline. The 
pharmaceutieally acceptable earners are well known in the art and also are commercially 
available. For example^ such acceptable carriers are described in £.W. Maitin*a Remington's 
Pharmaceutical Science, Mack Pubhshing Company^ Easton, PA. 
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The subject invention also concenSk polypeptides encoded by dse subject 
polynucleotidi:s. ^)ecifically exemplified are the polypeptides eucoded by the MAT-I and VSA 
genes of C rumimantlum, K ckqffeeasis, £ canis and the MP4 gene QtAne^la&ma mrginale. 
Pdypqitides unooded by £ du^eensis and £ canis MAP2 genes are also exemptified herein. 

5 

Also enconipassed ivithin the scope of the present invention axe fragments and variants 
of the e9cetnpliiied polynucleotides. Variants include polynucleotides and^or polypeptides 
having base or amino acid additions, deletions and substitutions in the sequence of the subject 
molecule so long as those variants have substantially the same activity or serologic reactivity 

10 as tfic native molecules. Also included are alldie variants of the subject polynucleotides. The 

polypeptides and peptides of the present invention can be used to raise antibodies that are 
reactive with the polypeptides disclosed hccdn. The polypeptides and peptides can also be used 
as molecular weight markers. 

Another aspect of the stAjcct invention concerns antibodies reactive with MAP-I and 

IS MAP2 polypqitides disclosed herem. Antibodies can be monoclonal or polyclonal and can be 

produced using standard techniques known in the ait Antibodies ofihc invention can be used 
in diagnostic and therapeutic s^lications. 

In a specific embodiment, the subject invention concerns a DNA vaccine (e*£.» 
VCLIOIQ/MAPI) containing the msLjoc antigenic protein 1 gene (MAPI) driven by the human 

20 cytomegalovirus. (HCMV) enhancer-promoter injected intramuscularly into 8-10 week-old 

female DBA/i mice after seating them wift 50 pl/muscle of 0^% bupivacaine 3 days 
previously. Up to 75% of the VCLlOlO/MAPl-iramunized mice seroconvehed and reacted with- 
MAPI in antigen blots. Splenocytes from immunized mice^ but not from ccmtrol mice 
immunized wifliVCXlOIO DNA (pksmid vector, Vical, San Diego) proliferated in response to 

25 recombinant MAPI and C rumirumtmm antigens in in virro lymphocyie proliferation tests. 

These proliferating cells from mice immunized with VCLIOIO/MAPI DNA secreted IFN- 
gamma and JL-Z at concentzations ranging from 610 pg^l and 152 pgM to 1290 pg/iml and 310 
pg/ml, respoctiYcly. In experiments testing different VCLIOIO/MAPI DNA vaccine dose 
regimens (2S-I00 ^g/dose, 2 or 4 immimizations). survival rates of 23% to 88% (35/92 

30 survivors/total in all VCLIOIO/MAPI immunized groups) were observed on challenge with 

30LD50 of C. nmimmtium. Survival rates of 0% to 3% (1/144 survivors/total in all omtrol 
groups) were recorded for control mice immunized similarly with VCLlOlO DNA or saline. 
AccordiAgly» die subject inventiott eoucems flie diacovety that the gene encoding the MAPI 
protein can induce protective immunity as a DNA vaccine against rickettsial disease. 
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Hie nucleic acid sequences described herein have o^er uses as well. Fo? example^ the 
nucleic adds of the subject inventicm can be useful as probes to identify conqslementaxy 
sequences within other nucleic add molecules or genomes. Such use of probes can be applied 

to identify or dtstinguish infisctjous stmina of fflrganiffljl in ngnffft'c pmrj^tiw; nr in TTcTogr fferf fll 

5 rescazch where idcntificatifmofpaiticularQrgamsms or strain As is well known in 

the fiiti probes can be made by labeling the nucleic acid sequences of interest according to 
accepted nucldc acid labeling pracedures and techniques. A person of ordinaiy skill in the art 
would recognize that variations or fragments of the disclosed sequences which can qsedfically 
and selectively l^hiidtze to the DNA of rickettsia can also fimction as a probe. It is within the 
10 ordinaxy skill ofpersons in the an, and does iu)t require undue experimentation in view of 

description provided hcrdn, to determine whether a segment of the claimed DNA sequences is 
a fragment or variant which has characteristics of the full sequence. e.g., whether it specifically 
and selectively hybridizes or can confer protection against rictccttsial infection in accordance 
with the subject invention. In addition, widi the benefit of the subject disclosure describing the 
IS , specific sequences, it is within the ordinaiy ddU of those persons in the art to label hybridizing 
sequences to produce a probe. 

It is also well known in tixe art that restriction enzymes can he used to obtain fimctiona] 
fragments of the subject DNA sequences. For example, BaBl exonuclease can be conveniently 
used for time-controlled limited digestion of DNA (commonly lefenied to as ''erase-a-base** 
20 procedures)* See, example, h^iatise/dil(19S2)A/o/ea(/arC/c^ 

- Cold Spring Harbor Laboratojy, New YoriqWd^rai (1983) J. £/o/. CAent. 258:13006-13512. 

In addition, dienudeic acid sequences ofthe subject invention can be - 
wdght markers in nucldc acid analysis proceduxes. 

25 Following are exanq}ieswUch i]hi8tzate pioccduties£^ 

eiuunples should not be construed as limiting. All percentages are by weight and all solvent 
mixture proportions arc by volume imless otherwise noted. 

Example 1 

30 A nucldc acid vaccine consUuct was tested in animals for its ability to protect against 

death caused by in&ction with the rickettsUCmdh^ The vaccine construct tested 

was the MAP 1 gene of C. rundnantium inserted into plasmid V(X1010 (Vical, San Diego) under 
control of the human ^ftomegalovinis promoter-enhancer and intnm A. In this study, seven 
groups containing 10 xnice each were itgeeted twice at 2^week intervals with either 100, 75, 50, 
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r25figVCX10lO/MAPl DNA(V/MinTablc 1 below),orl00,50|igVCL1010DNA(Vin' 
Table 1) or saline (Sal.), respectively. Tw weeks after fhe last injections, 8 mice/group were 
challenged with 30Lt>50 of C rumbumtivm and clinical i^ptoms and survival sionitoiied. Hie 
remainin g 2 mice/group were not challenged and were used for lymphocyte proliferation tests 
5 and cytokine measurements. Tberesdts of the stntfy are summarized in Tabic t,bdow: 



Table! 




100 pg 
V/M 


75 pg 
V/M 


SOpg 
V/M 


25 pg 
V/M 


100 PS 
V 


50 pg 
V 


Sal. 


Survived 


5 


7 


5 


3 


0 


0 


0 


Died 


3 


1 . 


3 


5 


8 


8 


8 



the VCLIOIQ/MAPI nuelde acid vaccine increased survival on challenge in all groups, with 
a told of 20/30 mice surviving compaved to 0/2^ 
1 5 This stnd/ was repeated with aiiodier 6 groups^ eadi containbg 3 3 miee (a total o f 19 S ' 

mice). Three groups received 75 |ig VCLIOIO/MAPI ONA or VCLlOlO DMA or saline (4 
injections in all cases). Two wedcsafier the last injection. 30 mice/gro^ 
30LDS0 of C rumimmiium and 3 mice/jgroiip were saoificed Soc ^rmpbocyte proliferation tests 
imd cytokine measuiienients. The lesidts ofthis study are simmiarteed in Table 2, bdow 

20 

■ " Table 2 

V/M2ini. V2inj. SaL2inj, V/M4inj. V4inj. Sal.4inj. 

Survived 7 0 0 8 0 1 

25 

*Itt mice ftat died in both V/M groq)s» there was an increase in mean survival time of 
qipnudniatcfy 4 days compared to the ccmtrok (p<0.0S). 

Again, as summarized tn Table 2, the VCLIOIO/MAPI DNA vaccine uicreased the 
3 0 numbers of mice surviving in both immunized groupst although ihm was no apparent benefit 
of 2 additional injections. In these two experiments»fliere were a cumulative total of 35/^ 
(38%)$undvingmiceingro«^receivii«theVCL101Q/MAPlDNAya» 1/144 
(0.7%) survivir^ mice in the control groups. In bodi imnumization and challenge oials 
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described above, sp&nocytes from VCLIOIQ/MAPI immunized mice, but not from control 
mice, speciflcaUy proliferated to recombinant MAPI protein and to C ruminemxium in 
lymphocyte proliferation tests. These proliferating splenocytes secreted JLrl and gamma' 
interferon at concentrations up to 310 and 1290 pgtal respectively. These data show that 
S protection against rickettsial infections can be achieved withaDNA vaccine. In additicm» ttese 
experiments show MAPI-related proteins as vaccine targets. 

Example! 

The MAPI protein of C ruminantium has significant similarity to MSP4 of A. 
inOTjina^e, mid related molecules nmy also be pre&erxting To prove 

this, we used primers based on r^ionscanserved between C namaaii/itfmand>L jno/^guia/ein 
PCR to clone a MAPl-Iilffi gene fiom £1 cAq^^eitrir. The amino acid sequence derived frcHn 
the cloned E, ehqffeensis MAPI-like gene, and alignment with the corresponding genes of C 
nim/minritim and >1 m^iiiate is shown in Kgore L We have now identified the regicms of 
MAPl-Iilce genes ivhich are higihly conserved between £%r/icftui, Cawdria^ and Anaplasma and 
which can allow cloning of the analogous geties fiom other ridcettsiae. 

Example 3 -> Cloning and sequence analysis of MAPI homologue genes of £. cha/feensis and 
E. canis 

20 Geneshosnol0Si>u5totfaenisgorair&oepiotNnof C rumuumtiumTAKiX were cloned 

fiom £1 dhajfeensis and £ conis by using PGR cloniDig stiat^es. The cloned segments 
icprtsent a 4.6 Id) genomic locus of £dbi^^ DNAsequence 
generated fiom these clones was assembled and is presented alor% with the deduced amino acid 
sequence in Figures 2A-2B (SEQ IDNOs. 7-11 and 14-18) and Figure 2C (SEQ IDNOs. 12-13 

2S and 19*20). Signiiicant fixtures of dze BNAinch^ 

reading fiames (OSFs) for £1 c&i^^faQtmand two veiy ainilar, nonidentical ORFs fiir the£ 
i^mu ekmcd loous. TlieOSFs for bodi£Mi^Ate$pp« are separated by ix^ 
ranging from 264 to 310 base pairs. The noncoding sequences have a higher A+T content 
(71.6% for £1 dtaffeensis and 76.1% fisr R canis) than do fbe coding sequences (63.5% for E. 

30 duffeensis and 68.0% for £ otnur). A G-iich region -200 bases upstream fiom tte initiation 
codcn, $igmaF70-like pnmioter sequences^ putative xibosome binding sites (RBS)» termination 
codons, and palindromic sequences nearihe termination codons are found in each of tiie £ 
diajfeemis noncodiqg scquftncfts. The £cantf noncoding sequence has 4ie same feature eiccept 
telho G-rich region (Figure 2C; SEQ ID NOs. 12..13 and 19-20). 



J 



10 



IS 
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Sequence eomparisons of tlie ORFs at flic nucleotide and traifilatcd amino add levels 
revealed a high degree of similarity bctwctu dietn. Hie siniilarity spanned the entiie coding 
sequences, except in three regions where notable sequence variatioos were observed including 
fiome deletions/insertions (Variable Regions I, Hand Despite ^similarities, no two QRFs 
S ' are identical. The cloned ORP 2, 3 and 4 of ^. choffeensis have complete coding sequences. 
The ORF 1 is a partial gene having only 143 amino aci& at Oe C-tenninus whei«aa the ORF5 
is nearly complete but lacks 5-7 anuzio adds and a tenninati Hie cloned 0RF2 of £ 

coiuf also is a partial gene lacking a pait of the C-terminaIs6^ Tbe oveiall similarily 
between dificrent QRFs at the amino acid levd is 56.0% to 85.4% for £. ckaffeeasis^ whereas 

10 &rKcanisiti&S33%. The similarity of £ c&jj^feeiiwORPs to the MA^ 

reported for C rumimmtium isolates ranged fiom SS.5% to 66.7%. while for £ cams to C 
nmiinantium it is 48^% to S4^%« Due to 'Sieir high degree of similazxty to MAPI surface 
antigen genes of C namnmibm and since they are nonideotical to each oiher» the£ duffeensis 
and £oaiaiyaRFs a r e i db ic d to hcre&i as putative Vari^^ The 

15 apparent molecular masses of tfie predicted matute proteins of £ duiffeensis were 2S.75 kDa 
fbr VSA2. 27 J8 for VSA3, and 2755 for VSA4, while £ caaisf VSAl was slighily higher at 
29.03 Wb. The first 25 amino acids in each VSA coding sequence were diminated when 
calculating the protein size since they maxfccdiy resembled the signal sequence Of C 
mminantiumldAFl andpresumaUy wouldbeabsentfiomttematin Predktedpvotem 

20 s]ze$far£ chqffeensis VSAl ^iy&AS. mi R 
- - conq^lete genes w|?e not cloned. 

It should be imdersiDoddut the exam|des and embodunentsdesG^ 
illustKattve puiposes only and that various modifications or dmges in Ught thereof win be 
. 25 suggested to perscmB skilled in die art and are to be induded wilUn the spirit and purview of (his 
appUcatian and ftie scope of the appended claims. 
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' SEQUENCE LISTING 



(1) GENG3WC INFORMATION: 



{i) APPIiXCANT: 



Applicant Name (a): 
Street Address s 



university of Florida 

223 Grinter Hall 

Gainesville 

Florida 

US 

32611 

(352) 3S2-892d 



City: 



State/Province : 



Country: 



postal dode/2lp: 



Phone number: 



Pax: (352) 392^€SC0 



(ii) TITXiE OF INVENTION: NUcleic Acld vaccines Against 
ILlcketCslal Diseasss and Methods o£ Use 

(111) NUNBBfi OF SEQUENCBS: 24 

(iv) COERBSPOWENCB ADDRESS: 

(A) ADDRESSEE: Saliwanchik, Lloyd & Saliwancshik 
(E) STREET: 2421 N.W. 4X8t Street, Suite A- 1 
(C] CITY: Gainesville 

(P), STATS: FL 

(E) COUNl^Y: USA 

(F) ZIP: 32606 

(V) GOHPUTSR RSAIIABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC coB^atlble 

(C) OPERATINO SYSTEM; PC-D0S/H5-D0S 

(D) SOFTWARE: Patent In Release #1.0, Version #1.^0 



(vi) -CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER:' PCT ~ - " 

(B) FILING PATS: 17. October 1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME; Pace, Doran H 

(B) REGISTRATION NUMBER: 36,261 

(C) REFERENCB/DOCXET NUMBER t UP-367C1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 352-375-8100 

(B) TELEFAX: 352^372-5800 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I.ENCTU: 864 baee pairs 
(5) TYPE: nucleic acid 

(C) S11U\NDB0NES6 : single 

(D) TOPOLOGY : linear 
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(11} mX^BCOhS TYPIS; DMA (genomic) 

(ix) fBATURE: 

{A) HAME/XEY: CDS 
(B) LOCAXlOSr: 1..8G1 

(xi) SEQUEKCS DBSCRXPTZOK: 5£Q ID KQ:!: 

ATC5 MX TGC AAG AAA ATT TTT ATC ACA |«3T ACA CTA ATA TCA TTA OIO 48 
JSftt Afln Cyn Lys Lys He Phe He Thr Ser Thr Lau 11b 6er Leu Val 
1 5 10 15 

TCA TTT TTA CCT GGT GTG TCC TTT TCT GAT <3TA ATA CAO OAA (SAC ABC 36 
Ser Phe Leu Pro Gly Val Ser Phe Ser Asp val He Gin alu Asp ser 
20 35 30 

AAC CCK GCA <96C AGT GTT TAC ATT ABC GCA AAA TAC AtG OCA ACTT GCA 144 
Asn Pro Ala Gly fier Val Tyr He fier Ala Lys nyr Met Pro Thr Ala 
35 40 45 

^ TCA CAT TTT GOT AAA ATG TCA ATC AAA GAA GAT TCA AAA AAT ACT CAA 192 
' Ser His Phe Gly Lys Met Ser lie Lys Glu Asp Ser Asn Thr Gin 
5<) 55 . £0 . . 

ACG GTA TTT GGT CTA AAA AAA GAT TCG GAT GGC GTT AAA ACA GCA TCA 240 
Thr Val Phe Gly Leu Lys Lys Asp Tzp Asp Gly Val Lys Thr Pro Ser 
65 70 75 80 

GAT TCT ABC AAT ACT AAT TCT ACA ATT TTT ACT GAA AAA GAC TAT TCT 206 
Aap Ser ser Asn Thr Asn Ser Thr lie Phe Thr Glu Lys Asp Tyr Ser 
85 90 '95 

TtC AGA TAT GAA AAC AAT CCG TTT TTA GGT TTC CrCT GGA GCA ATT GGG 336 
. Phe Arg Ty^ Glu Asn A&n Pro Phe Leu Gly Phe Ala Gly Ala He Gly 

100 " - - 105 . llO 

TAC TCA ATG AAT GGA CCA AGA ATA GAG TTC GAA GTA TCC TAT GAA ACT 384 

Tyr ser Met Asn Gly Pro Arg He Glu Phe Glu Vai Ser Tyr Glu Thr 

115 120 125 

TTT OAT GTA AAA AAC CTA C>GT GGC AAC TAT AAA AAC AAC GCA CAC ATG . 432, 

Phe Asp Val Lye Asn Leii Gly Gly Ash Tyr Lys Asn Asn Ala His Net 
130 135 140 

TAC TGT GCT TTA GAT ACA GCA GCA CAA AAT A6C ACT AAT GGC GCA GGA 480 
Tyr Cys Ala Leu Asp Thr Ala Ala Gin Asn Ser Thr Asn Gly Ala Gly 

145 ISO 155 160 

TTA ACT ACA TCT GTT ATG GTA AAA AAC GAA AAT TTA ACA AAT ATA TCA 526 
Leu Thr thr Ser val Met Val Lys Asn Glu Asn Leu Thr Asn He Ser 
165 170 175 



TEA ATG TTA AAT GCG TGT TAT GAT ATC ATG CTT GAT GGA ATA CCA OTT 
Leu Met X*eu Asa. Ala Cyo Tyr Aop lie Met Leu Aap OXy lie Pro Val 
180 185 190 



576 
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TCT CCA TAT OTA TOT OCA GGT ATX QGC ACT GAC TTA GTG TCA CTA AXS 624 
Ser Pro Tyr val Cya Ala Oly lie Cly Thr Asp Lev Val Ser Val lie 
19S 200 2Q5 

AAT GCr ACA AAT CCT AAA TTA TCT TAT CAA GGA AAG CTA GGC ATA AQT 672 
Aan Ala Thr Asn Pro Lys Leu Scr Tyr Gla Oly lys Leu Gly lie Ser 
2X0 215 220 

TAC TCA ATC AAT TCT GAA GCT TCT ATC TTT ATC GGT GGA CAT TTC CAT 720 
Tyr Ser lie Afin ser Glu Ala ser lie Pke lie Oly Gly Hie Phe Hie 

225 230 23S 240 

AfiA GTT ATA GGT AAT GAA TTT AAA GAT ATT GCT ACC TTA AAA ATA TTT 768 
Arg Val lie Gly Asn Olu Phe Lys Asp lie Ala Thr Leu Lys lie Phe 
245 250 25S 

ACT TCA AAA ACA GGA ATA TCT AAT GCT GGC TTT GCA TCA 6CA ACA CTT Bl€ 
Thr Ser U/b Thr Gly lie Ser Asn Pro Qly Phe Ala Ser Ala Thr Leu 
260 2€S 270 

GAT GTT T6T CAC TTT GGT ATA GAA ATT GCA GGA AGG TTT GTA TTT 061 
Asp Val Cya His Phe Oly He Glu He Gly Gly Arg Phe Val Phe 
275 280 285 

TAA B64 

(2) INFORKATlbHf FOR 8EQ ID N0:2: 

(i) SEQUBKCE CHARACTS&IfiTXCS: 

(A) LESiGTHs 287 amino acids 

(B) TYPE: amino add • 
(D) TOPOLOGY: linear 



— (ii)- MOLECULE TYPE;- protein. . . _ ^ ^ 

(xi) SEQX7EKCB DBSCRXPTION: SEQ ID firdi2: 

Met Asn Cyc Ly9 Lyo He Phe He thr Ser Thr Leu He Ger Leu Val 
_ 1 _ S 10 15 

Ser Phe Leii Pro Gly Val Ser Phe Ser Asp Val lie Gin Glu Asp Ser 
20 25 30 

Asn Pro Ala Gly Ocr Val Tyr He Ser Ala Lys Tyr Met Pro Thr Ala 
35 40 45 

Ser His Phe Gly Lys Met ser He Lyo Glu Asp Ser Lys Asn Thr Gin 
50 55 60 

Thr Val Phe Gly Leu Lys Lye Asp ttp Asp Gly Val Lys Thr Pro Ser 
« 70 75 80 



Asp Ser ser Asn Thr Ann Ser Thr He Phe Thr Glu Lys Adp Tyr Ser 
85 90 95 
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Phe Arg lyr Glu Asa Aan Pro Phe Leu Gly Phe Ala Gly Ala He Gly 
100 105 110 

Tyr ser Met Asn Gly Pro Arg He Glu Phe Glu val Ser lyr Glu Thr 

lis 120 125 

Phe Asp Val Lys Asn Leu Gly Gly Aen lyr Lys Asn Asn Ala His Ket 

130 135 140 

Tyr cys Ala Leu Asp Thr Ala Ala Gin Asn Ser Thr Asn Gly Ala Gly 
145 ISO 155 

Leu thr Thr Ser Val Met Val Lys Asn Glu Asn Leu Thr Aen He 6er 
165 170 175 

Leu Met Leu Asn Ala Cys Tyr Asp He Met Leu Asp Gly He Pro Val 
180 185 IBO 

Ser pro Tyr Val Cys Ala Gly Zle Gly Thr Asp Leu Val Ser val He 
195 200 205 

Asn Ala Thr Asn Pro Lys Leu Ser Tyr Gin Gly Lys Leu Gly lis Ser 
210 215 220 

Tyr Ser lie Aon Ser Glu Ala ser He Phe lie Gly Gly Bie Phe His 

225 230 235 240 

Arg Val He Gly Asn Glu Phe Lys Asp He Ala Thr Leu Lys He Pha 
245 250 2S5 

Thr Ser Lys tAlt Gly He Ser Asn Pro Gly Phe Ala ser Ala Thr Imi 
260 265 270 

Aep val eye His Phe Gly He Glu lie Gly Gly Arg Phe val Phe 

- 275 - . - _ . . _ .280^ 285 



(2) INFORMATIOSr POR SEQ ID N0:3: 

(i) SEQtlEUCE CHARACTQIISTICS : 

(A) LEKGTH: 842 base pairs 

(B) TYPE: zmeleic acid 

(C) STRANDED^SS: single 

(D) TOPOLOGy: linear 

(ii) MOLECULE TYPE: QUA (genomic) 

(ix) FEATURE: 

(A) KAME/XST: CDS 

(B) LOCATZONf 1..640 



(xi) SEQUENTCE DESCRIPTIOKs $jEQ ID 130:3: 
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ATG AAT TAC AAA AAA AST «tc ATA ACA GCG ATT GAT AXC ATT AAT ATC 48 
Met Aan Tyr Lys Lya Ser Phe He Thr Ala He Asp He He Asn He 
290 295 300 

CTT CTC TTA CCT GGA GTA TCA TTT TCC GAC CCA AGO CAG GTA GTG CTTC 96 
Xieu Leu Leu Pro Gly Va3. Ser I^he Ser Asp Pro Arg Gin Val Val Val 
305 310 

ATT AAC GGT AAT TTC TAC ATC ACT GGA AAA TAG GAT GCC AA6 GCT TCSG 144 
He Asn Gly Asu Phe Tyr He Ser Gly Lys Tyr Asp Ala Lys Ala Ser 
320 325 330 335 

CAT TTT GGA OTA TTC TCT GCT AAG CAA qAA AGA AAT ACA ACA GTT GQA 192 
His Phe Gly Val PJxe Ser Ala Lys Glu Glu Arg Aan Thr Thr Val Gly 
340 34S 350 

GTG TTT GGA CTG AAG CAA AAT TGG GAC GGA AGC GCA ATA TCC AAC TCC 240 
Val Phe Gly Leu Lys Gin Asa Trp Asp Gly aer Ala He Ser Aon ser 
355 360 3^5 

TCC CCA AAC GAT GTA TTC ACT GTC TCA AAT TAT TCA TTT AAA TAT 6AA 286 
Ser Pm Abxi Asp V«l Phe Thr Val Ser Asn Tyr Ser Phe Lys Tyr Glu 
370 375 380 

AAC AAC CCG TTT TTA GGT TTT GCA GQA OCT ATT GGT TAC TCA ATG QAT 336 

Acn Aen Pro Phe Leu Gly Phe Ala Gly Ala He Gly Tyr Ser Met Asp 
385 390 395 

GGT CCA AGA ATA GAG CTT GAA GTA TCT TAT GAA ACA TTT GAT GTA AAA 384 

Gly Pro Arg lie Glu Leu Glu val ser Tyr Glu Thr Phe Aep val Lye 

400 405 410 415 

AAT CAA GGT AAC AAT TAT AAG AAT GAA CCA CAT AGA TAT TGT GCT CTA 432 
Aan Gin Gly Aan Asn Tyr Lys Asn Glu Ala His Arg Tyr Cy« Ala Leu 
- - - - - ^ .-420_ , _ ^ _ _ 42S 430 

TCC CAT AAC TCA GCA GCA GAC ATG AGT AGT GCA ACT AAT AAT TTT GTC 480 
Ser Hi8 Asn Ser Ala Ala Asp Met Ser Ser Ala Ser Asn Asn Phe Val 
435 440 445 

TTT CTA' AAA AAT GAA GGA TIA- CTT GAC ATA TCA TTT-ATG CTC AAC GCA . 528 

Phe Leu liya Asn Glu Gly Leu Leu Asp He 9er Phe Net Leu Asn Ala 
450 455 460 . 

TGC TAT GAC GTA GTA GGC GAA GGC ATA CCT TTT TCT CCT TAT ATA TGC 576 
Cya Tyr Asp Val Val Gly Glu Gly He Pro Phe Ser Pro lyr He Cys 
465 470 475 



GCA GGT ATC GOT ACT QAT TTA GTA TCC ATG TTT GAA GCT ACA AAT CCT 624 
Ala Gly He Gly Thr Asp Leu Val Ser Met Phe Glix Ala Thr Asn Pro 
480 485 490 495 

AAA ATT TCT TAC CAA GGA AAG TTA GGT TTA AGC TAC TCT ATA AGC CCA 672 
Lye He Ser Tyx Gin Oly Lys Leu Gly Leu Ser Tyr Ber He Ser Pro 
500 SOS 510 
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GAA GCT TCT OTG TTT ATT GGT GGG CAC TTT CRT MjS GTA ATA GGG AAC ^ 720 

Glu Ala sor Val Pha He 6Iy Giy His Phe His Lys Val He Gly Aan 
515 . 520 B25 

GAA TTT AGA QAT ATT CCT ACT ATA ATA CCT ACT GGA TCA ACA CTT GCA 768 
Glu Phe Ar0 Acp He Pro The He He Pxo Thr Gly fier Thr Leu Ala 
530 535 , 540 

GGA AAA GGA AAC TAC CCT GCA AtA GTA ATA CTG GAT GTA TOC CAC TTT 8X6 
Gly Lys Gly Asn Vyr Pro Ala He Val He Leu Asp Val Cyc Hie Phe 

S4S. 550 S55 

GGA ATA CAA AtG GGA GGA ACG TTT AA 842 
Gly He Glu Met Gly Gly Arg Phe 
560 565 



(2) INFOSMATZOzr FOR SEQ ID KO:4r 

(1) SEQUEKfCE CHARACTBRlSTtCS: 

(A) LERGTS; 280 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pxOCein 

(xi) SBQliEiffCB DESCRIPTIW: SEQ ID 110:4s . 

Met Asn Tyr Lys Lys Ser Phe He Thr Ala He Asp He He Asn He 
15 iO 15 

Leu Leu Leni Pro Gly Vol fier Phe Ser Asp Pro Arg Gin Val Val Val 
20 25 30 

lie Asn Gly Aan Phe- Tyr -He Ser Gly .Lys. tyr Asp Ala^Lys Ala Ser 
35 . 40 45 

His Phe QXy Val Phe Ser Ala Lys Glu Glu Arg Asn Thr Thr Val Gly 
50 S5 SO 

Val Phe Gly Leu Lys Cln Asn Tip Asp Gly Ser Ala He Ser Asn Ser 
65 70 75 80 

ser Pro Asn Asp Val Phe Thr Val Ser Asn Tyr Ser Phe Lys Tyr Glu 
B5 90 95 

Asn Asn ^ro Phe Leu Gly Phe Ala Gly Ala He Gly Tyr Ser Het Asp 

100 105 110 - 

Gly Pro Arg He Glu Leu Glu Val Ser Tyr Glu Thr Phe Asp Val Lys 
lis 120 125 

Asn Gin Gly Asn Asn Tyr Lys Asn Glu Ala His Tyr cys Ala Leu 

130 135 140 



DEC-10-98 THU 03:12 PM HESKA CORPORATION FAX NO. 970 221 1126 



W 98iltSSi l>CI7US97/190i4 

19 

Ser Kis Asn Ser Ala Ala Asp net Ser Ser Ala Ser Asn Aan Phe Val 
145 150 ISS 160 

Phe Len Lys Asn Glu Giy Leu Leu Asp He fier Phe Met lieu Adn Ala 
16S 170 175 

Gys lyr Asp Val Val Gly Glu Gly lie Pro Phe Ser Pro xyr He Cyu 
180 185 ISO 

Ala OXy He Gly Thr Asp Leu val ser Met Phe Glu Ala Thr Asn Pro 
195 200 205 

Lye He Ser Tyr Gin Cly Lye Leu Gly Leu Ser Tyr Ser He Ser Pro 
210 215 220 

Glu Ala Ser Val Phe He Gly Gly Kis Phe His Lys Val He Gly Asn 
225 230 235 240 

Glu Phe Arg Asp He Pro Thr He He Pro Thr Gly Ser Thr Leu Ala 
245 250 255 

Gly Lys Gly Aan Tyr Pro Ala He Val He Leu Asp Val Cys Ris Fhe 
250 265 270 

Gly He Glu Mec Gly Gly Arg Phe 
275 280 



(2) INFOXmATION FOR SBQ ID NO:S: 



(i) SSQUBtlCE CHARACTS&ISTICS s 

(A) tsKGTHs 849 base pairs 

(B) TYPE: nucleic acid 

(C) STRAZIDEDMESS: single 

(D) TOPOLOGys linear 



(ii) KOLBCULE TYPE: UNA (genomic) 

(ix) FBATORE: 

. (A) NAMB/KEY: CZ3S 
(B) LOCATION: 1. .846 

(xi) S2QUSNCE DESCSHPTION: SfiQ ID 210:5: 



ATG AAT TAC AGA Chh TTG TTT ACA GGG G6C CTG TOl GOV GCX! ACA GTC 
Net Asn Tyr Arg Glu Leu Phe Thr Gly Gly Leu Ser Ala Ala Thr Val 
285 290 295 



48 



TGC GCC TGC TCC CTA CTT GTT AGT GOG GCC OTA GTG GCA TCT CCC ATG 
Cys Ala Cys Ser Leu Leu val Ser Gly Ala val Val Ala Ser Pro Met 
300 305 310 



96 



AGT CAC GAA GTG GOT TCT GAA GGG GGA GTA ATG GQA GGT AGC TTT XAC 
Ser His Glii Val Ala Ser Glu Gly Gly Val Met Gly Gly Ser Phe Tyr 
315 320 325 



144 
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GTQ GOT GCG GCC TAC6A0C CCA QCA TTT OCT TCT OTT ACC TOG TTC GAC 152 

val Gly Ala Ala Tyr fier Pro Ala Phe P» s r Val Hir Ser Phe Aap 

330 335 340 

ATG CGT GAG TCA AGC AAA GAG ACC TCA TAC GTT AGA GGC TAT GAC AA$ 240 
Net Arg Glu Ser Ser Lys Glu Thr Ser Tyr Val Arg Gly Tyr Asp hya 
345 350 355 360 

AGO ATT GCA ACQ ATT GAT GTS AGT GTS CCA QCA AAC TXT TCC AAA TCT 2B6 

Ser lie Ala Thr He Asp val ser Val Pro Ala Asn Pbe Ser Lys ser 

365 370 37S 

GGC TAC ACT TTT GCC TTC TCT AAA AAC TTA ATC ACG TCT TTC GAC QQC 336 
Gly Tyr Thr Phe Ala Phe Ser Lys Asn Leu He Thr Ser Phe Asp Gly 
380 385 390 

GCT GTG GGA TAT TCT CTG GQA GGA GGC AGA GTO CAA TTO GAA GCG AGC 384 
Ala Val Gly Tyv Ser lieu Oly oly Ala Arg Val Glu Xieu Olu Ala Ser 
39S 400 405 

TAC AGA AGG TTT GOT ACT TT6 GCG GAC GGG CAG TAC 6CA AAA AGT GGT 432 
Tyr Arg Arg Phe Ala Thr I»eu Ala Aop Gly Gin Tyr Ala hya Ser Gly 
410 415 420 

GCG GAA TCT CTG GCA GCT ATT ACC GGC GAC GCT AAC ATT ACT GAG ACC 480 

Ala Glu Ser lieu Ala Ala He Thr Arg Asp Ala Asn He Thr Glu Thr 

425 430 435 440 

AAT TAC TTC GTA GTC AAA ATT GAT GAA ATC ACA AAC AOC TCA GTC ATG 528 
Asn tyr Phe Val Val Lys He Asp Glu He Thr Asn Thr Ser Val Met 

445 450 455 

TTA AAT GGC TGC TAT GAC GTG CTG CAC ACA GAT TTA CCT GTG TCC CCO 576 
Leu Asn Gly Cys Tyr Asp Val Leu His Thr Asp Leu Pro Val Ser Pro 

460 465 470 . _ _ 

TAT GTA TGT GCC GGG ATA GGC GCA AGC TTT GTT GAC ATC TCT AAG CAA 624 

Tyr Val Cya Ala Gly He Gly Ala Ser Phe Val Asp He Ser Lys Gin 

475 480 485 . 

GTA ACC ACA AAG CTG GCC TAC AGG GGC AAG GTT GGG ATT AGC TAC CAG 672 
Val Thr Thr Lys Leu Ala Tyr Arg Gly Lys Val Gly He Ser Tyr Gin 
490 495 500 

TTT ACT GGG GAA ATA TCC TT6 GTG GCA GGT GGG TTC TAC CAC GGG CIA 720 
Phe Thr Pro Glu He Ser Leu Val Ala Gly Gly Phe Tyr His Gly Leu 
505 510 ^ 515 520 

TTT GAT GAG TCT TAC AAG GAC ATT CCC GCA CAC AAC AGT OTA AAG TTC 768 

Pha Asp Glu Ser Tyr Lys Asp lie Pro Ala His Asn Ser Val Lys Phe 
525 530 535 



TCT GGA GAA GCA AAA GCC TCA GTC AAA GCG CAT ATT GCT GAC TAC GGC 
Ser Gly Glu Ala LyB Ala ser val uys Ala His Xle Ala Aep Ityr Gly 
540 545 550 



816 
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* TTT AAC CTT GOA GCA AOA TTC CTO TTC AGC TAA C 849 

Phe Asn Z«eu Gly Ala Arg Phe Leu Phe Ser 
S55 S60 

(2) XNFORNATION FOR SSQ ID N0:6: 

(i) SEQUE17CB CKAHACTBRISTXCS: 

<A) LENGTH: 2B2 amino acidS 
<B) TYP£: amino acid 
(D) TOPOLOGIT: linear 

(ii) NOLEOHiE tyPH: protein 

(xi) SSQUfiNCB DESCRIPXIOKs SEQ ID HO:£: 

Met Asn Tyr Arg 61u Leu Phe Tlzr Cly Gly i<eu ser Ala Ala Thr Val 
15 10 15 

Cy5 Ala cys Ser Leti Leu val fier Qly Ala Val Val Ala Ser Pro Met 
20 25 30 

Ser His Glu val Ala Ser 6lu Gly Gly Val Net Gly Gly Ser Phe Tyr 
35 40 45 

val Gly Ala Ala xyr Ser Pro Ala Phe pro Ser Val Thr Ser Phe Asp 
50 55 $0 

Net Arg Glu Ser Ser Lys Glu Thr Ser Tyr Val Arg Gly Tyr Aep Lye 
65 70 75 80 

Ser lie Ala Thr lie Asp Va]* Ser Val Pro Ala Asn Phe Ser Lys Ser 
85 90 d5 

Gly Tyr Vhr Phe Ala Phe Ser Lya Asn Leu lie Thr Ser -^Phe Asp Gly 
100 105 no 

Ala Vai Gly Tyr Ser Leu Gly Gly Ala Arg Val Glu Leu Glu Ala Ser 
115 130 125 

Tyr Arg Arg Phe Ala Thr Leu Ala Asp Gly Gin Tyr Ala Lys Ser Gly 

130 135 140 

Ala Glu ser Leu Ala Ala lie Thr Arg Asp Ala Asn lie Thr Glu Thr 
145 150 155 ISO 

Asn Tyr Phe val Val Lys He Asp Glu He Thr Asn Thr Ser Val Met 
165 170 175 

Leu Asn Gly Cys Tyr Asp Val Leu Bis Thr Asp Leu Pro Val Ser Pro 
180 IflS 190 

Tyr Val Cys Ala Qly He Gly Ala Ser Phe Val Asp He Ser Lys Gin 
195 200 205 
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• Val Thr Thr Lya Iiett Ala Tyr Arg Gly Lys Val Gly He Ser Tyr Gin 

210 215 220 

Phe Thr Pro Glu lie Ser Leu val Ala Gly Gly pte Tyr His Gly Leu 
225 220 235 240 

Phe Asp Glu Ser oyr I#ys Asp lie Pro Ala His Asn S^r Val Lys Phe 
245 2S0 ass 

Ser Gly Glu Ala Lys Ala Ser Val Lys Ala His He Ala Asp Tyr Gly 
260 265 270 

Phe Asn Leu Gly Ala Arg Phe lieu Phe Ser 
275 2fi0 
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1 K A oonaposition campnsing a polynucleotide which encodes a polypqptide having the 

2 characteristic of clxciting an uninuno re^Kknse protective against disease or death caused by a 

3 rickettsial pathogen. 

1 2. The composition, according to claim I, wherein said rickettsial pathogen is selected 

2 fiom die group consisling otRidusUsia s|ip^ EbrUdm qip., Amplasma spp^ and Cawdria ffpp. 

1 3; The composition, according to claim U wherein said polypq>^ 

2 aequence selected from the group consisting of SEQ ID NO. 2, SEQ ID NO. 4* SEQ ID NO. 6, 

3 SEQ ID NO. 14» SEQ ID NO. IS, SEQ ID NOS. 16-20, SEQ ID NO. ^» and SEQ ID NO. 24, 

4 or a fragment thereof. 

1 4. The conqsx)6ition,aceordmg to claim 1, wherein said polynueleoti^ 

2 add soqucnee selected fmmte group consisting of l, SEQ IDN0.3,SEQIDNO* 

3 S,SEQIDN0.7,SEQIDN0.8,SEQIDNOS.9-13,SEQJDNO. 21. and SEQ ID NO. 22, 

4 . . orafiagmentihaeoE 

1 ' S. 'fhe composition, according to olaim 4, wfaereb said pcriynuoleotide has a 

2 add sequence of SEQ ID NO. 3> or a fragment thereof 



1 6. the coinposition, according to claim 1, wherein said polynucleotide findier 

2 conqirises a nucleic acid vacdne vector. 

1 7. The composition, according to dahn 1, further 

2 ■ a i ccepwM e eanier. r. 



1 
2 
3 



t. Apolynudeoddeencodii^a pcdypqitidelttvbgan amino acid^ 
fiomthC6ioiV60i>sktiqgofSEQIDN0.4,SEQIDNOS. I4-Z0. SEQ1DN0S. 23-24.aiul 
fagiwcnls fliflJBofc 
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1 9. Hic polynucleotide, according to claim 8, said poIyniScleotide having a nucleic acid 

2 sequence selected from the group consisting of S£Q ID NO. 3, S&Q ID NOS. 7-13, and SEQ 

3 ID NOS. 21^22- 

1 10. A method f(»r protecting a sosceptible animal host against disease or death ca^ 

2 by a rickettsial pathogen, said nietbod comprising administering an efTective amount of a 

3 polynucleotide encoding polypeptide having the charactehstic of 

4 protective against said rickettsial pathogen. 

1 11. The method, according to claim 10, wherein said rickettsial pathogen is selected 

2 from the group consisting ofRickeUsia spp.. Ehrlichia spp., Axiaplasma spp., and Cawdria spp. 

1 12. The method, according to claim 10, wherein said polypeptide has an amino acid 

2 sequence selected from the group cansisting of SEQ ID NO. 2, SEQ ID NO. 4, SEQ ID NO. 6, 

3 SEQ ID NO. 14, SEQ ID NO. 15, SEQ ID NOS. 16-20, SEQ ID NO. 23, and SEQ ID NO. 24, 

4 or a fi^^gment thereof. - 

1 13, Hie method, according to daim 10. wherein said polynucleotide has a nucleic acid 

2 sequence selected fromfte group ccmsisting of SEQ ID NO. 1, SEQ ID NO. 3, SEQ ID NO. 5. 

3 SEQ m NO. 7. SEQ mNO. 8, SEQ mNOS. 9-13, SEQ ID NO. 21, and SEQ ID NO. 22. 

1' 14,-The method, aocoidjng to chim 13, whoxm 

2 sequence of SEQ ID NO. 1. 

- I . 15. 71telE^thpd,acG(ndi^gtoc^^ 

2 sequence ofSiEQ ID NO. 3. 

1 16. The method, according to claim 13. wherdn said polynucleotide has the nucleic acid 

2 sequence ofSEQ ID NO. S. 



1 17. The meftod, according to claim 10, wberem^td nuclei 

2 appropriate nucleic acid vector. 



UhU-lU-yb THU u;j:ib m HESKA corporation fax no. 970 221 1126 p. 28 



W098/itiS54 PCt/US97/19(M4 

25 

< 1 IS. The method, accoidtng tt> claim 10« vrfaerein said composition further comprises a ^ 

2 pharmaceuticallyaccqitable earner, 

1 19. A method for detecting, in a human or animal, antibodies associated with infection 

2 by Ehrlichia, wherein said method composes contacting a biological fluid from said human or 

3 animal with a polypeptide selected from the groiq) consisting of SEQ ID NO. 4, SEQ ID NOS. 

4 14-20, SEQ ID NOS. 23-24, and feagments thereof 
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1 g9Aacaaatcea9S9»eacst9taesectaaagegtSC9csaeaecacc«cec9C49cca 

9DLATV7LSVCHFCVBLGGS 
? » ? • 

lei ^gffc r»i*aaar . aaaraaa atLa iji iiyi ■■■■■ n gr^jg rMv^^ gacrqeooqqqaqqqa 

2 ^1 csaactac^ccsgcsacacsssscacsaccacsaicaccaafttaacssgarrinacftcaA 
301 cacetcssg gngM n awirt caacagsgaaaccagsas rnm tt ft a ncr i ite tMft BfcCttc 

OS 

3€1 AcsaaaAacs^estgejg^csgcrccamttgcaggscsaccaetgCTaAcccgixu 

-10 

421 catesacctscjgagtfsaacacgaaccgcgaaaaacrsccsacaacaacsgc^ 

SBS MHCCXP?ZT7Aj;iTL 

481 cCAAtgtgcccsgsacgsggaacatcacssta^ccgagcaearoatgacaaeattaga 

liHSFlPCZSLSOPVQDDIIZS 
541 ggcaacc£::=acacc&gc39«aagcatacgesa&gc9cs£egefttttS99a9Ctst£tet 

(CI flccaaggaa g aattgaaatacaacagczssttgtAt^sggaacagttgcAagatggggfttaoa. 

AKESfiKTfVCVPStSQDirO S 
6^1 tgttgcnataeg sagaaccaccscaagesacagagscacegcc^aa A esa ceea cr saaq 

CVIS8TTLS DIPTVPH Y 5 P JC" 
721 ^ca|^aaacBatetat:et:^aggat:r^gcmggagc^Cwggci:acuaaegg&t9geee& 

YBSNLF5CFACAIGTSJI0GP 

781 agaacag^igecsigaagtatcsTiacgaageacwgasgcsaaaaAteaaggtaacaaetae 

Ri BLEVSTSAFOVKIIDCirilT 
841 aagBBcgaageacatagacaetaCgetetgtsecatctteSeggcaeagagaeafiagaca 

JC NBAH8Y7ALSHLLGTETQI 
901 gac^gcgcaggcsagcgcgcrr^gccssscsMCaaacgaaggaesae^acaaaccacte 

D GAC£A^VPL Z KEG&LDXS P 
981 acgc7gaacgCBcgctac^C9caacaagrgaagscacAec?9tt^::c=£t:a.cacatgc 

K LNAeYDVZSBGZ PPSPTIC 
1021 gcaggcactggtactgaettagtatcga r gt gggaaecta caaa e/K?s aaaagttgcrat 

ACIGZOLVS H PEAXNPIt I S Y 
loai c^aggaaaaccaggecsaagcsacccsacaagcccagaagcssctgtgtvCaCCggCggA 

QGKLGti5YPZSPSA9VrZCO 

nc.^a^^egatB^uAC|o&^^ 

JOl teagegcttgeaggaMaggm^^ 

1281 ^eAMqMflggggaggajtggg^feMg*?^;^7fM^y^y*j^^ 

GISLCCR PMPOL^ 
^1321 acaqt:g gr «aaa ittifl tgta qrnnraii ^ ag^yytffitmoqgqqqa aetaaa^ 
13B1 Atacceet xae CB ccw C TarafyT<aa taaeergacaaaCAcaacagttcaaacaaaggt 
1441 aaacaactgctaaatxtgcgstatgagaacc attqmftmt sttaPBiKaaaaaccaoc^ 

-35 

1501 tg«cG:gtctxcacactgcagttgcsactacrgttaacttatt£fctcactacsc ca^^ 
-10 ass 
1581 ACaegaaesgrnnhnnnfrrrgearnarnnrtgcasaagcageaggaacgtgcettccac 

M HCiCiePP ZT'rALVSXiMffPIP 
1821 ctggaacaceacrgsgt ga rr^'a ac g e aa gg trgacaaea.cgagtggfcaactaeeatgtsa 

GZSPSDPVQGDVXSGAPT Vff 
18B1 gcggcaagcaCacgceaa w;C9Cgc s aea C€ttggcaCgtsggetgeeaaagnagaaaaaa, 

GK YI8P5ASKFGMP9AKSBXB 
1741 acecsaccgccgcactgcasggc^mnaacttflgacggggaagg^tcagctcaecaagtg 

PT V A L I C L K 0 D H E G I S S S S K 
1801 a ca a cgataatya crxeaataacaagg gfcgatfcgaerraaa fcafcuaaa ataaeeeatttt 

WDMHFMNHC r 5 F & T 8 tl ?l P F L 
1881 cagggccsgcaggttgcw^tggttaczcaacgggcg g cccaagagcagagtcsga^gtigt^ 

GFAGAZG7SMGGPBVBPEVS 
1P21 cecattgaaacacscgaGgcsaaaaaccagggcmacaaccacaaaaatgatgetcaeagat • 

YBTFDVICIIQCIIMYXIISAHBI 
1981 aesgtgecscaggtcaacaagacaacageggaataeccaaaaccaggaaatnrgeaetgt 

CALGQQOHSCI PXTSAtVLX. 
ao81 raaaa a gr gaaggatcgcsti^acaccacscatgetaaaegcacgcsacgatataataa 

XSBGLLDZS PMt NA qTPZ XM 
2101 aggagaggataeetssgtg trcgt a c a t atgtgcaggtgttggtXcCBasscaatattcca 

EffZPLSPtZCAGVGTDtZSM 
2181 fcgtteeoMQgeaeaaaCgtfa aaaatgccg ciinrwii qoaM^caggtBfcaaatiteaefcgca 

P£ATIiPKXSYQGXl.GLStSZ 
2221 camiftflcng aagecgccgcacttattggcggaeaccsccacaagg^aeag gaaar^jfln c 

NPSASVPZCGHPHXVZGKBP 
2281 gcagggacattcssacce&gaaageaRcgccaegecaccagccactccagacctagcaa 

EDZPtLKAFV7S 5ATPDZ.AX 



FIG.2A 



UhU-lU-atJ THU 03: IB m HESKA CORPORATION 



FAX NO. 970 221 1126 



wo 98/16554 FCT/USm«M4 

2341 SagcaAcaesaagsgcacgcsat^^cgsaacagaacttggaggaaggccsaacsscCaax: 

yTiSVCK?CIBLC CilFSI?^ 
2A01 tr:£gssac=gcgacacgcriiaaaataatcraaacccate£Sr?trrTr,gccac^graaac 
2461 a aa^T atf^g tfeoaa gaacg t a ric aat a aqaaooQcqQooocoqa ccaaatSgetacss 
2S2X acc»»csss^^csacacsacttacacriinncaaecs9acaaacacaacagcscssaga. 
2S8I a a« a ca aa ca ftcacctaftftcsceccc r i nriin aa« c g attt»t» eagcifgAe »*»^*»>5* «>a 

-3S 

2641 gc gc^c%^gc ggtcrtracaccataqgtesagtattaocttaAcsgag=ssaAegAte^«gmg 

-10 r ■ ^ 

27Q1 y;fc aatatga&ergeaaaaaaerttrr.arAaraaf?rj^riicgaQsaM?g!aAt^ 
BBS 

Z.PGX SPSDAVQUDKVGGK Ft 
2821 tAteagcgggAa&cacg^accaogcgcttcaoifc^cggcgllaCCstcSgct^aaAcaeaa 

IS.GXYVPSVSHFG VF SAK QS 
2861 J* ^ git an cacaAgAafccgg^gca.cacggiir.SAnjkggAft^ gggfl tggcagg ac aa c a tr 

KD.TT Z 0VF6 ]UICQDtf DGSTZ 5 
2941 caaaaattggce^gaaaaiiargatscaacgcaBcaa acsaeccagssaaaCacgaaa acaa. 

CKSPEII T PHVP D Y S F k T g B II 
dobl Seeaccrsca99«%stgc9ig9a9c»;9ccd9c&ace&aiat9a»^l9t9saa9aaca^et 

PPLGP ACAVGrCKCrGPglS L 
3 Q < 1 agaaacgcecracgaaacaccsgi&cgcgaaaaaeeagggcaagaac^caagaacgacgc 

eNSTBTPDVIC il QG NHrcKDJl 
3121 ccacaaacac^cgctc^aacocaCftacagcg3gsgaaagc^aa9Qaas9^99V9B.Baa 

KK.T Y ALTHN£GGlCLSIIJlCdt 
3181 gczrgctr^c^aaaaaacgaaggac tacccgatacaccacczacgssga&tgcacgsta 

PV ptKSZGLliOrS LMLtlACT 
3241 tga tg^aacaag £gaagg a aca c grr^tctsct;!acACa£gc;a^gt;gt^ggtactga 

DVZsecZ PPSPYICACVGTP 
3301 gcsaAcatgca rycstaajggga&a^gsg fcaaaatg^csACgnnflffaaaqrraqgsra 

LI'S M P EA I N PK I SY^QLLO Im 
33€I sagetaCSceacaageccagaagcrscLgccccsgczggcggacaC^SCSaCaaggcgac 

ST. SI SPEASVPVGGKFHKVZ 
3421 agggaatgaacceagagacattccTfgeCiaC^aCikessagtacsTsaaesscsaeagg^aa 

Gtl.BPBD ZPA KXP^TSTLTG tf 
3481 tcaccttaega.tag t aa c aetaagtgt:ai:gccact:t3ggagrggaaercggaggaaggcs 

HP.TI y7X#SVCHPGVSLGGB 7 
3541 taacttccaacsssatsa csgcc a c acgC Maaa a t aatsraaactt0CSStt t« tt c a£gg 

N p: ♦ 

3 €01 crucaqti t a ■■ ^■ ■ ^ ■■ t' wtf ttfgcaiiaaiTita cfltaq e iLa c aaga g pw i uuuu 

34CX sasacaagtag^gcrrcsgscaeecttaCTcatgatacsAtacrsaaecaqcczrsccge 

3721 gat:saecsaegt:gaegMacacataaattCg;scotr,ainiaa>qct aicc^^ 

3782 aaaaat tt*fc»gfcc caacc=occ£Csa»6sagasttcsagtAeagstaacssacssro 
-IB 

3 841 actat t«agt±a cacatgMcg agn aa a aaflCttttc»taaeaaqcgeac£gatt 

BBS IINTK1CVFIT5XLI5L 
3901 acatxszece7ae9Cgsa9Biie=acsztee9accpa<jcasgcagt^£2.s2aa£ggcaat 

X S S L P C V 5 P O P A- C _ 5 C I H C S 
39€1 csccaeatcagcggaaaatacatgcw^gcgescegcatsscggagtarzctctgetaag 

PV I S G KYM P SASH PG VP SA K 
4021 gaagaaagaaatacaacagcsggagcgczcggaecgaagcaaa&Rg^acggaagcgca 

eBBNTTVGVFG LKOBWPGSA 
4081 acatecaacssccccccaaacqatgmttcnctjycctcaa aesacgga zz eaaacafccaa 

Z£HSSPKDVPTV£AT5?K7e 
4141 aacaacTcgecr&taggs^csgeaggagctattggttaeteaa&ggatggtcc^agaata 

B»P FI.CPAGAi c y S K D' O P B J 

4201 gagcttgaagtatettac^aaaeacccgacgtaaaaaaceaaggcaaeaattacaagaat 

Bli EV5lBTF9T<llQ6niirK II. 
4241 9aageae&ta9acattgtg9tetaceceataaefceageft9ea9aeatgagtagt9caagt 

B AHB7C ALSKBSAADM^5 AS 
4321 aacaaccr^gccrtxccaaaaaatgaaggaccaeccgaeata-tcacctacgetgaacgea 

IIBFVPLKBEGLLDZ5FNLRA 
43 Bi tgc ta tgacgcagcaggcgaaggcacacccttt tx^ccutcacacgsgcaggcateggc 

c T : 0 yvcEci ppspticacxg 

4441 actfiacc^aca^cca ccrggcqaagc&acaja fcrcrs aaan ctcctsacaaaggaaagcsa 

TDLVSHF fiAT «PKI57QClCL 
450 1 ggtteaagcnacccmcaagcgcagaagcstccgcgcstAttggcgggcacSg tffac aa g 

GLSYSlSPBASVPrcCKPHK 
45fil gtaatagggaacgaatttagagaCAtccctactaCaacaecCaccggacgaacacctgea 

V IGIieFBOI PTIZPTCS 7LA 
4621 ggaaaaggaaartrawflflegcaBtagtaataccggacgtacgceaccssggaatagaaatg 

CK GNX PAZVSLDVCHFCZBX 
4491 gga 

0 
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1 ^girccre aaasacgaaatacnna minac ^tgcacftgcaacsacacsagSfccgagcaacttte 

IsGXyMPTAffMFGZPSAXSE 
Q5?TICVLVGLD0RI»SH RI219 

101 cccac^ss£agaagtggcaggftgg£a.ccgggaissGaAcaggrimrr.ffMgaa:aga«ct 

3£1 agaag»€caeatgaaatat^ga"iir!rn«nnaeeeagsr»8iiiftft«crjkerziiii«rgBf^ 

SVSNEZPDTKIIPGNVTZ^NOS 
421 zcasaantiatsgegctstacs&^cggAAgtsaeatacgeagsg&cggaa&c&gcggaga 

MKYCALSKG SHiCffDGNSGD 
481 ttgtf^AeaesgnaauaetgatugcicgCiiCCtssigaaaAacgaaggzsc&crcgaegc 

tfrTAJCTDXPVX>LXil£G LLDV 
541 cscatsgacgti M inii rgr a cg gsacga c ft c na c aaccgaaaaaatOgaBsrstgacgt:Sa 

S. FMZ.RACTPXTTEXMPPS PT 
601 ca.ca tgcgcaggs&t^ggtaucrgace £cattacerasgtt:tgagAeaatc acj > aaii e nii«w t 

X CACZOTOLZSKPSTTQNXt 
4£1 aceczatgaaggaamggcagggrsaaag&atacsacaaacrran gagcstcsgsst:SSgc 

£?OGCLGLIIITINSaVSVrA 
721 aggc^esc&ctcacaaggsaaeaggsaacgaactmaggc&ccss&actctacsapG 

GG»FHKV!GNEPX6X P7tLP 
7ai taatggateaancTiCrwiiagtacaaeagcetgcaacagtaaftacsagatgcggacgatgc 

OC SHZieVOQSATVTLDVCK? 
841 eaggscagagaceggaaLgcagatttttettttaacaecsecas^graeaegszaonaaca 

CZ.EtGStPFF» 
901 jcactagcstgetsctgrggctsagaaaegeaagagagaaacagttta g sa ac aaa rtnga 
961 aagctawarnrragaaaagccacacgsstttcatrg^sacsgacac&gnnrTnnMgt'Mg 
1021 ^r.nn atrgrhTfefcaeeaBBaagss^cgcagcacacttaaaC^v— -ara aaagcc»c 
1D81 eatr**'* ^^*^***^**"T"^*****^*W^^J^""^^*"^g^^'^^^''^^ 

-35 ' - 

1141 tMCscaecctcain:gtctj^gf£t£taaacacgaactgcaaaaM 

BBS tt29CXXVPYZ5 

1201 ^AZtg^tMaMmWM£szssstaiecssm^^ 

XLZSSXTPLPHV^Tl^iiPVYC 
1261 aacagcatcytatggtaaccsgsacatBtcaggaaagtac^t:geeaagcgctgctga.st&t 

VSHVGliPyZSGKTMPSV PHF 
1221 ggBBCstccccagetgaagaagagaMaMBiigaeaagcgeagcaeaeggtrraaaagnft . 

GIPSAeEEXXKTTVVtGLKE 
1281 Mecgggeaggag&tgeaBtatctagteaaagteeagacgacaacrtcacc&crcgaaat; 

HHAGDAXSSQSFDDMPrX ftfi 
1441 taeccACgeaagtaeggnngciinraBgccsctaflggc^^gcagcagccaccggctAgtcg 

Y S FKYASRXPLGPAVAI CYS 

15 01 atagge&gceeaagaatagfaagttgag^tgcccsatgaageacrts&cgcaaBBaAteaa 

ZGSPHZElTEtlSTEArDVKNQ 
1561 ggcaacaaCw 
G *r H 
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1 *»tgtat:acattat:a7taa«aatgtCaccgt«t«tattcatAagttaagt»^ 

01 at»ccactctccttcacti:tatcagaagactttrtatttattcacaaactcatgacgtat:ag 

121 W<»caaat:aaaca«ctgcaae!tgcaatcact:acgtaaaactttaactcttcttttt^ 

181 acaactaaaatactaataaaagtaatatagtatuiaaaaatcttaagtaacaieftataat 

-35 

241 attactctgatalAGCaiatgtctagratctctatacrtiaaacgtttatataat tg^ 
-10 ' 

301 tattaATGAAAGCDVrCJ^AArTCAIACrTW 

MKAI KFIlKVCti; FA^AX FXi 

361 XASGetMTCCTAlMTiVC?^^ 

GYS Y IT KOGI FQT KHEDT PH 

421 ATACXACTAXACCAAA3?G7tfW3ACGGmW 

TTZP HSDGZQSSF/SLINQDG 

481 CTAAAACACTAMCAfiCOVAfiATTTariAGGGAAA^^ 

KTVTS QD FLG KHMLVL FCFS 

541 CTC«TGTAAAAGCArrTGCCCTGCAG?«TTGG^^ 

ACKS 1 CPAB1GLVS EALAQ2. 

601 ttgctam:miTGcj^gj\camsiacj^^ 

gkkad k lqvl fi tipfrn dt 

651 CTGIAGAAAAATTAAAAesUCCTTCAffSMiamTrG}^ 

VBK LKBFHBHFDS RIQMITG 

- 721 - ©«A3ACTG?kW5^^ 

II T EDI N 01 I K V y K I Y V O Q A D 

781 AX3AAfiA.TCMaWVTTAACC3OTCTGCAAIAra 

KDHQINHSAIMYLI DKKGSY 

841 ATCT!^^CAa^nrTCATTCC^^ 

L fi H P 1 P B L X 5 Q ° i N Q V " D~ K t L S 

901 w^iA^iGTIWVGCAGaM^ 

961 ataaattcatggaatacgttggatgagtaggttttttttasrtawtttagtffctaa 
1021 attggcat 
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1 g^aaatetcatgtaaacgtgaaatactatattcttttttaaataccaatacaatitgaata 
61 caaaaaaacttttacaacttattatgtttiatcttaaaaccrttactttaagattccttacg 
^ 121 tMcaaaat a aCTaaaatactatttacaaaatiacaccacaattteGatcaaataaaaaaaa 



181 
241 



ctatacaetCCantacactiacagtagatataccataAaagattttaagtaacSZ^^ta 

-35 

atottaemtggtaTaBCMafcgatt^ 

301 ScataaaATG3U«GTTMCAAATTTAIAcTrA?^^ 

M K V I K F I L N I C L L F A -»A 1 F 

361 TClAGGggArTCCTACGPACaAAACft^yggCA:^^ 

L G y S Y V T R Q G I F; Q V R D H H T P 

42i CAftlftCftAM!AIflTaVflftIAAAGCOVGq\TTAC^^ 

WTKXSNKXSITT-S FSLVKQD 

I 

481 TQGMAXMA€£AAAXA6TCM)GAZTrTn 

6N T VH S Q DFL 6 KY M LVjb F 6 F 

341 yiVri^JyPGTRAAAgCATCTgC O Cg G CT G iUira 

S SCXfiXCPABZiOXASB VL SQ 

601 . GCRGGTAATGACACiyGVUIAAGTIDUCA^ 

L6N D T DKLQVX F X T 1 D P T K 1> 

t 

661 WCLiSIPiSXMJ'SCSM^^ 

TV.QKIiXTFBSBrDPRIQMLT 

721 AG6CAGrGCA£S^AraZATTGAM?UUV2AA3^^ 

- - G A- E -1) -1 Z X I -I K N y -K -I Y— V - G -Q A~ 

781 AGAXAAAG9lXAArCMj^<3^TCAC;CT6CX:AT^^ 

DKPKQXDfiSAZMYXIP K X G £ 

841 A£AQ)7TtC7U3U:iTTTCTCX3VraTTX^^ 

Y .1 S K F ,S JP P .I K ^5 2 K Q_V O X L I, 



901 AayegftgAftgAllAAeRMATCrgHaAih*" 

SXIXQYl* ^ ■ 

961 latatoaattcatggatatatgtgatgggtagatttcttttggtgtttctotcgctaatt 
1021 aaatta 
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